Model Selection

Information retrieval optimization

# Information retrieval optimization

MiniCOIL is a sparse contextualized word-by-word embedding model specifically designed for efficient semantic similarity computation

Text Embedding English

Plamo Embedding 1b

PLaMo-Embedding-1B is a Japanese text embedding model developed by Preferred Networks, demonstrating outstanding performance in Japanese text embedding benchmarks

Transformers Japanese

Reranker ModernBERT Large Gooaq Bce

This is a cross-encoder model fine-tuned from ModernBERT-large, used to calculate the scores of text pairs, suitable for text re-ranking and semantic search tasks.

Text Embedding English

Reranker Pho BLAI

This is a Vietnamese text ranking model based on the Apache-2.0 license, primarily used for Vietnamese text ranking tasks.

Large Language Model Other

Hypencoder.2 Layer

Hypencoder is a hypernetwork model for information retrieval, consisting of a text encoder and Hypencoder. It can convert text into a small neural network and output relevance scores.

Transformers English

Hypencoder.8 Layer

Hypencoder is a dual-encoder model for information retrieval, consisting of a text encoder and a hypernetwork (Hypencoder), capable of converting text into small neural networks for computing relevance scores.

Transformers English

Lightblue.lb Reranker 0.5B V1.0 GGUF

A lightweight text ranking model suitable for information retrieval and document sorting tasks.

Large Language Model

Modernbert Base Msmarco

This model is a fine-tuned sentence embedding model based on ModernBERT-base, specifically designed for sentence similarity tasks and supports English text processing.

Text Embedding English

Arabic Retrieval V1.0

A high-performance Arabic information retrieval model built on the sentence-transformers framework, optimized for the richness and complexity of the Arabic language.

Text Embedding Arabic

Arabic Reranker

This is an Arabic reranking model based on the BERT architecture, specifically designed for Arabic text, performing reranking tasks by scoring and sorting text options.

Text Embedding Arabic

Polish Reranker Bge V2

This is a reranking model based on BAAI/bge-reranker-v2-m3 and further fine-tuned on a large-scale Polish text pair dataset, supporting long-context processing.

Transformers Other

PhoRanker is a cross-encoder model for Vietnamese text ranking, capable of efficiently classifying and ranking Vietnamese texts.

Transformers Other

Norwegian Nli Triplets C

A Norwegian sentence embedding model fine-tuned based on jina-embeddings-v2-base-en, focusing on keyword document search and sentence similarity tasks

Text Embedding Other

Crossencoder Me5 Base Mmarcofr

This is a French cross-encoder model based on multilingual-e5-base, specifically designed for passage reranking tasks.

Text Embedding French

Crossencoder Camembert Large Mmarcofr

This is a French cross-encoder model specifically designed for passage re-ranking tasks in semantic search.

Text Embedding French

Gte Large En V1.5

GTE-Large is a high-performance English text embedding model that excels in multiple text similarity and classification tasks.

Transformers Supports Multiple Languages

Splade V3 Distilbert

SPLADE-v3-DistilBERT is the DistilBERT version of naver/splade-v3, which performs excellently in tasks such as information retrieval.

Transformers English

Splade V3 Lexical

SPLADE-v3-Lexical is a term-weighted version of the SPLADE model, focusing on information retrieval tasks without expansion at the query end.

Transformers English

Polish Reranker Base Mse

This is a Polish text ranking model trained using Mean Squared Error (MSE) distillation method, with a training dataset containing 1.4 million queries and 10 million document text pairs.

Transformers Other

Polish Reranker Large Ranknet

This is a Polish text ranking model trained using the RankNet loss function, with a training dataset consisting of 1.4 million queries and 10 million document pairs.

Transformers Other

Minilm L6 Danish Reranker

This is a lightweight Danish text ranking model adapted from the English MiniLM-L6 model, specifically designed for Danish information retrieval tasks.

Text Embedding Other

Simcse Retromae Small Cs

A small Czech semantic embedding model fine-tuned with SimCSE objective based on RetroMAE-Small

Transformers Other

Mmlw Retrieval Roberta Large

MMLW (I Must Get Better Messages) is a neural text encoder for Polish, optimized for information retrieval tasks.

Transformers Other

Mmlw Retrieval E5 Base

MMLW (I Must Get Better Messages) is a Polish neural text encoder optimized for information retrieval tasks, capable of converting queries and passages into 768-dimensional vectors.

Transformers Other

Ember v1 is an embedding model based on sentence-transformers, primarily used for feature extraction and sentence similarity calculation.

Transformers English

bge_micro is a lightweight sentence similarity calculation model based on transformer architecture, specifically designed for efficient feature extraction and sentence similarity tasks.

Bge Base En V1.5 Ct2

BGE Base English v1.5 is a transformer-based sentence embedding model, specifically designed for extracting sentence features and calculating sentence similarity.

Transformers English

Crossencoder Camembert Base Mmarcofr

This is a French cross-encoder model based on CamemBERT, specifically designed for passage reranking tasks, demonstrating excellent performance on the mMARCO-fr dataset.

Text Embedding French

Biencoder Camembert Base Mmarcofr

This is a dense single-vector dual-encoder model for French, suitable for semantic search tasks.

Text Embedding French

Compositional Bert Base Uncased

A sentence similarity calculation model based on the CompCSE dedicated dataset, suitable for English text processing.

Transformers English

perceptiveshawty

Instructor Large

INSTRUCTOR is a text embedding model based on the T5 architecture, focusing on sentence similarity calculation and text classification tasks, and supports English language processing.

Transformers English

Msmarco Distilbert Base Tas B Mmarco Pt 300k

This is a Portuguese sentence embedding model based on the DistilBERT architecture, specifically optimized for semantic similarity tasks.

Transformers Other

Msmarco Distilbert Base Tas B Mmarco Pt 100k

This is a Portuguese sentence transformer model based on DistilBERT, specifically designed for sentence similarity and semantic search tasks.

Transformers Other

Cross Encoder Mmarco German Distilbert Base

A German cross-encoder model fine-tuned on the MMARCO dataset for query-passage relevance scoring

Text Embedding German

Monot5 Large Msmarco 10k

A T5-large reranker fine-tuned for 10,000 steps on the MS MARCO passage dataset, excelling on non-MS MARCO datasets

Large Language Model

Monot5 Base Msmarco 10k

A reranker based on the T5-base architecture, fine-tuned for 10,000 steps on the MS MARCO passage dataset, with excellent zero-shot performance.

Large Language Model

SGPT 2.7B Weightedmean Msmarco Specb Bitfit

SGPT-2.7B is a sentence transformer model based on the weighted mean method, focusing on sentence similarity tasks, trained on the MSMARCO dataset with BitFit technology applied.

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase